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ABSTRACT 

Measurements of galaxy clustering are now becoming possible over a range of redshifts 
out to z ~ 3. We use a semi- analytic model of galaxy formation to compute the 
expected evolution of the galaxy correlation function with redshift. We illustrate how 
the degree of clustering evolution is sensitive to the details of sample selection. For 
a fixed apparent magnitude limit, galaxies selected at higher redshifts are located 
in progressively rarer dark matter haloes, compared with the general population of 
galaxies in place at each redshift. As a result these galaxies are highly biased tracers of 
the underlying dark matter distribution and exhibit stronger clustering than the dark 
matter. In general, the correlation length measured in comoving units, decreases at 
first with increasing redshift, before increasing again at higher redshift. We show that 
the e-model often used to interpret the angular correlation function of faint galaxies 
gives an inadequate description of the evolution of clustering, and offers no physical 
insight into the clustering process. We compare our predictions with those of a simple, 
popular model in which a one-to-one correspondence between galaxies and dark halos 
is assumed. Qualitatively, this model reproduces the correct evolutionary behaviour at 
high redshift, but the quantitative results can be significantly in error. Our theoretical 
expectations are in good agreement with the high redshift clustering data of Carlberg 
et al. and Postman et al. but are higher than the measurements of Le Fevre et al. 
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1 INTRODUCTION 

In general, the clustering properties of galaxies are expected 
to vary with cosmic time. The nature of this evolution de- 
pends on factors such as the values of the cosmological pa- 
rameters, the identity of the dark matter and the details 
of the galaxy formation process. Traditionally, attempts to 
infer the evolution of galaxy clustering have relied on mea- 
surements of the angular correlation function of faint galax- 
ies, ui(9), as a function of apparent magnitude (some recent 
examples include Efstathiou et al. 1991; Roche et al. 1993; 
Infante & Pritchet 1995; Hudon & Lilly 1996; Brainerd & 
Smail 1998; Postman et al. 1998). The spatial correlation 
function, £(r), may be inferred from the diluted signal in 
projection by inverting Limber's (1954) equation given as- 
sumptions about the redshift distribution of the galaxies and 
the evolution of their clustering. A simple model of the evo- 
lution of the two-point spatial correlation function, the 'e- 
model' (Groth & Peebles 1977), has often been used to in- 
terpret this kind of data (Efstathiou et al. 1991). Recently, 
it has become possible to measure the spatial correlations 
directly in deep spectroscopic surveys (Le Fevre et al. 1996; 
Giavalisco et al. 1998; Carlberg et al. 1999). 



The main difficulty in interpreting measurements of 
galaxy clustering is the uncertain relation between the ob- 
served galaxies and the underlying mass distribution. This 
problem is compounded by the fact that the properties of 
the galaxies selected according to any simple criteria (eg a 
magnitude limit) are likely to vary over a long redshift base- 
line. Thus, the interpretation of the data requires a model 
for galaxy evolution. In this paper we use a semi-analytic 
model of galaxy formation in which the observable proper- 
ties of galaxies are calculated ab initio within a specified 
cosmological model. The semi-analytic model that we use 
is a development of the one described in a series of ear- 
lier papers (Cole et al. 1994; Heyl et al. 1995; Baugh, Cole 
& Frenk 1996a,b) and is fully discussed in Cole et al. (in 
preparation). Models of this kind have been successful in ac- 
counting for many properties of the galaxy population such 
as the shape of the luminosity function, the distribution of 
colours, and the faint counts (White & Frenk 1991; Lacey 
al. 1993; Kauffmann, White & Guiderdoni 1993; Cole et al. 
1994; Somerville and Primack 1998). 

Semi-analytic models have been used to investigate the 
clustering of galaxies (e.g. Kauffmann, Nusser & Steinmetz 
1997; Kauffmann et al. 1998, 1999). An earlier version of the 
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model employed here was used to predict that the 'Lyman- 
break' galaxies discovered by Steidel et al. (1996) at z ~ 3 
should have a comoving correlation length similar to that 
of bright galaxies at the present day (Baugh et al. 1998; 
see also Davis et al. 1985). This theoretical prediction was 
subsequently confirmed by observations (Adelberger et al. 
1998; Giavalisco et al. 1998). Baugh et al. (1998) predicted 
that Lyman break galaxies are hosted by the most massive 
and therefore the rarest dark matter haloes in place at z ~ 
3. Such haloes are biased tracers of the underlying mass 
distribution and are more strongly clustered than the dark 
matter which, in current models of structure formation, was 
much more weakly clustered at z ~ 3 than at present. 



2 MODELLING GALAXY CLUSTERING 

2.1 The e-model for the two-point galaxy 
correlation function 

The correlation function of local, optically selected galaxies 
is well described by a simple power law £(r) = (ro/r) 1 , with 
a slope of 7 « 1.8 and a correlation length of tq ~ 5/i~ 1 Mpc 
(Groth & Peebles 1977) for r < 10/i _1 Mpc. Groth & Pee- 
bles (1977) proposed the "e-model" to describe the redshift 
evolution of the correlation function, measured in terms of 
proper separation: 



£(r,*)=(^) 7 (l+*)-^. 



(1) 



If e = 0, the product of the proper number density of galaxies 
and the correlation function is constant and the clustering 
pattern is fixed in proper coordinates; this is known as stable 
clustering (Peebles 1980, §56). The correlation length, xo, 
can also be written in comoving units as 



xo = ro(l + z) 



-(3+£- 7 )/ 7 



(2) 



where ro = xo(z — 0). The clustering pattern is fixed in 
comoving coordinates ife = 7~ 3»— 1.2, whilst the cor- 
relation function evolves as expected in linear perturbation 
theory, for fi = 1, if e = 7 - 1 « 0.8. 

2.2 The hierarchical clustering of dark matter 

Significant progress has been made in our understanding 
of the dynamical evolution of dark matter in the universe, 
primarily as a result of N-body simulations (e.g. Jenkins et 
al. 1998). The growth of clustering in the dark matter can 
be approximated by a scaling formula that transforms the 
amplitude and scale of a linear density fluctuation into the 
corresponding values for a nonlinear fluctuation (Hamilton 
et al. 1991; Peacock & Dodds 1996). 

The clustering evolution of dark matter haloes is sig- 
nificantly different from that of the underlying dark matter 
(e.g. Cole & Kaiser 1989; Mo & White 1996; Kauffmann, 
Nusser & Steinmetz 1997; Bagla 1998; Matarrese et al. 1997; 
Moscardini et al. 1998). Haloes more massive than the char- 
acteristic mass M» (z) Q initially display stronger clustering 

* The characteristic mass is denned by <t(M») = S c (z), where 
tr(M) is the rms linear density fluctuation at z=0, and S c (z) is 
the extrapolated critical linear overdensity for collapse at redshift 
z - see Lacey & Cole 1993. 



than the dark matter. The clustering of the dark matter 
subsequently grows faster than that of these haloes, reduc- 
ing their bias. The bias is further diluted by the subsequent 
formation of haloes of mass M, when M < M*. The cluster- 
ing evolution of dark haloes sets the scene for understanding 
the clustering evolution of galaxies. 



2.3 Galaxy clustering in a semi-analytic model 

The expected number of galaxies per dark matter halo and 
its associated scatter depend on the details of the galaxy 
formation process (Benson et al. 1999), while the observed 
number per halo depends also on the selection criteria. Semi- 
analytic modelling provides the means to calculate how dark 
haloes are populated by visible galaxies and also to derive 
the expected galaxy correlation function. The latter requires 
the following four steps: 

(i) Obtain the nonlinear correlation function of the dark 
matter distribution using the scaling formula of Peacock & 
Dodds (1996). 

(ii) Select model galaxies according to the observational se- 
lection criteria. 

(iii) Compute an effective bias parameter, b c g (assumed to 
be independent of scale) using equation (^|) or (|^) below, 
weighting each halo by the number of galaxies that satisfy 
the observational selection criteria: 



f b(M)N ga i(M)n(M)dM 
J N gal {M)n{M)AM ' 



(3) 



where, b(M) is the bias parameter of dark matter haloes of 
mass M; N ga i (M) denotes the mean number of galaxies in a 
halo of mass M that satisfy the selection criteria; and n(M) 
is the number density of dark matter haloes of mass M. 
(iv) Obtain the real space correlation function of galaxies 
(ie. free of distortions due to peculiar velocities) by multi- 
plying the nonlinear dark matter correlation function by the 
effective bias parameter squared, £, ga i = ^Cdm- 

The bias parameter for dark matter haloes of mass M is 
given by (Mo & White 1996 ; see also Cole & Kaiser 1989): 



bi(M,z) 



Sc(z)D(z) 



S c (z) 
o{M) 



(4) 



where D(z) is the linear growth factor, normalized to unity 
at the present day. This expression has been tested against 
N-body simulations (Mo & White 1996; Mo, Jing & White 
1996) and works in practice down to the Lagrangian radius 
of the dark matter halo, r ~ Tl, where rz, — (3M/47rpo) 1//3 
and po is the present mean density, even in the mildly nonlin- 
ear regime where £(r) ~ 1. Jing (1998) extended the compar- 
ison to higher resolution simulations and found that equa- 
tion (Q) systematically underpredicts the bias of haloes with 
M/M* <g 1; better agreement is obtained with the slightly 
modified prescription: 



b 2 (M,z)=b 1 (M,z) 



0.5 



(S c (z)/a(M)Y 



(0.06-0.02n) 

II , (5) 



where n is the effective spectral index of the power spec- 
trum, din P(fc)/dln k, at the wavenumber defined by the 
Lagrangian radius of the dark matter halo, k = 2n/rL- 
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Figure 1. The correlation function of galaxies brighter than 
(a) mji = 25.5 and (b) rrin = 23.5, as a function of redshift, 
plotted against comoving separation. The correlation function of 
the dark matter is shown in (c). The correlation functions are 
plotted at z = 0.25 (solid line), z = 1 (dotted) and 2 = 3 (dashed). 
The horizontal dotted line marks £,(xq) = 1 which defines the 
comoving correlation length, xq. 



3 RESULTS 

We consider the clustering of galaxies in a low density, flat 
cold dark matter universe, with present-day density param- 
eter Qo = 0.3, and cosmological constant Ao = 0.7, nor- 
malised to reproduce the observed abundance of rich clus- 
ters (a 8 = 0.94; Eke, Cole & Frenk 1996). Benson et al. 
(1999) show that the correlation function of L > L* galax- 
ies in this model is in good agreement with that measured 
for APM Survey galaxies (Baugh 1996) over separations in 
the range 0.3/i -1 Mpc < r < 15/i -1 Mpc. This agreement is 
not reproduced in critical density models. 

The expected evolution of galaxy clustering for samples 
selected at magnitude limits of tur = 25.5 and ran = 23.5 
is shown in Figs |l|a and Qb respectively. The correlation 
functions are plotted at redshifts of z = 0.25 (solid line), 
1.0 (dotted line) and 3.0 (dashed line). The corresponding 
dark matter correlation functions are shown in Fig 0c. It 
is clear from Fig. 1 that the correlation function of galax- 
ies changes little over the entire redshift range < z < 3. 
The correlation function of the mass, on the other hand, 
grows appreciably between z = 3 (xo = lft _1 Mpc) and 
2 = 0.25 (x = 4.6A _1 Mpc). The markedly different evo- 
lutionary rates of the clustering of galaxies and mass was 
already noted in the first simulations of biased galaxy for- 
mation in a cold dark matter universe (Davis et al. 1985). 

The non-monotonic variation of the correlation length 
(in comoving units) with redshift is shown in Fig. |i[ The 




Figure 2. The comoving correlation length, defined as the sep- 
aration for which £(xq) = 1, plotted as a function of redshift. 
(a). The correlation length of galaxies brighter than mji = 23.5 
(open squares) and that of galaxies brighter than mjj = 25.5 
(open triangles). The filled squares show the comoving correla- 
tion length of the dark matter. The dotted lines show the evo- 
lution expected in the e-model of eqn 2, with e = —1.2,0 and 
0.8, assuming 7 = 1.8 and x (z = 0) = 5.75/i _1 Mpc. (b). The 
effects of dust are demonstrated by the filled triangles for galaxies 
brighter than m^j = 25.5. The dashed line shows the correlation 
length predicted by a model that assumes a one-to-one correspon- 
dence between galaxies and dark halos down to a lower halo mass 
limit, chosen such that the number density of galaxies in the semi- 
analytic model, without dust, is reproduced. The solid line shows 
the result when dust is included. 



top panel displays the evolution of the comoving correlation 
length of galaxies with apparent magnitude brighter than 
mjj = 23.5 (open squares) and = 25.5 (open triangles). 
Only points for which there are fifty or more model galaxies 
in the sample are plotted. The internal errors in our es- 
timate of the correlation length are typically smaller than 
the symbol size, except for the point at z = 4.5 for galax- 
ies brighter than tur < 23.5; the uncertainty in this point 
makes it consistent with the correlation length obtained for 
the fainter sample. The correlation length typically decreases 
at first with increasing lookback time, reaches a minimum 
around redshift 1-2 and then grows again as the selected 
galaxies find themselves in increasingly rare haloes. Similar 
behaviour was seen in a simulation of the same cosmology by 
Kauffmann et al. (1998). The number density of galaxies in 
apparent magnitude selected samples falls rapidly with in- 
creasing redshift; at the same time, the bias parameter of the 
occupied haloes is increasing. The interplay between these 
effects can give rise to features in the comoving correlation 
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Figure 3. The evolution of the comoving correlation length 
of galaxies out to z ~ 1. The lines show the predictions of the 
semi-analytic model, usingthe bias parameter given by eqn. (jij) 
(solid line) and by eqn. feh (dashed line.) In (a), galaxies are 
selected with apparent magnitude ma < 21.5 and absolute mag- 
nitude Mr < —20 — 2. The points show a preliminary analysis 
of clustering in the CNOC2 field survey (Carlberg et al. 1999). 
In (b), galaxies are selected in the observer frame /-band with 
17.5 < Iab < 22.5. The squares show the correlation length in 
comoving units derived from the CFRS by LeFevre et al. (1996) 
for O = 1 (filled) and £7 = (open) . The filled circle shows the cor- 
relation length inferred from the angular clustering over a larger 
area by Postman et al. (1998). 



length, such as the small peak at z ~ 2 in the ma < 23.5 
sample. 

The evolution of the galaxy clustering length is very 
different from that of the dark matter, shown by the filled 
squares in Fig. [](a). For the cases illustrated here, the co- 
moving correlation length of galaxies at z = 4 lies in the 
range xo — 5-5.5/i -1 Mpc, whereas xo = 0.5/i~ 1 Mpc for the 
dark matter. The dotted lines in Fig. ^ show the e-model for 
various values of the parameter e. Evidently, there is no sin- 
gle value of this parameter that gives an acceptable match 
to the galaxy data over the whole redshift range, a conclu- 
sion suggested by other workers using different techniques 
(Giavalisco et al. 1998; Connolly, Szalay & Brunner 1998; 
Matarrese et al. 1997; Moscardini et al. 1998). 

In FigJ^(b), we show how the evolution of the correla- 
tion length is altered when the effects of obscuration by dust 
are included (see Cole et al. in preparation). For redshifts 
z > 2, the correlation length shows less evolution when dust 
is included, though the differences are relatively minor. 

The dashed and solid lines in Fig.^|(b) show the pre- 
dictions of a simple clustering model which is often used 
in the absence of a physical description of galaxy forma- 



tion. In this model, a one-to-one correspondence between 
galaxy luminosity and the mass of the host dark matter 
halo is assumed. Haloes are populated with single galaxies 
down to some minimum mass, chosen in order to match an 
observed galaxy number density (see for example Moscar- 
dini et al. 1998; Mo, Mao & White 1999; Adelberger etal 
1998). In Fig. |^(b), the minimum halo mass at each red- 
shift is set in order to match the number density of galaxies 
in the semi-analytic model. The solid line shows the results 
of applying the simple model to match the number den- 
sity in the semi-analytic model with dust; the dashed line 
shows the case when dust is neglected. Note that the pre- 
dicted number densities of galaxies are different in these two 
cases for z > 1. The model with dust gives one galaxy per 
square arcminute that satisfy the colour selection used to 
isolate 'Lyman-break' galaxies, in agreement with the ob- 
served value (Adelberger et al. 1998). 

The "one-galaxy-per-halo" model displays qualitatively 
similar clustering evolution to that predicted by the semi- 
analytic model. However, quantitatively, the simple model 
seriously underpredicts the correlation length at low red- 
shifts, where haloes that host more than one galaxy make a 
significant contribution to the clustering. At high redshifts, 
where one might at first expect the simple model to be more 
appropriate, the clustering length can be up to a factor of 
two greater than in the semi-analytic model, when the ef- 
fects of dust are included. In the semi-analytic model, we 
find that the mean UV luminosity of the central galaxy in 
a dark matter halo increases with halo mass, though there 
is scatter in this relationship. This means that there is not 
a sharp transition in mass between haloes that never con- 
tain galaxies bright enough to be included in the sample 
and haloes for which the mean galaxy occupation number 
is unity. In addition at high redshift, the host haloes are 
more massive than M*, so their abundance falls exponen- 
tially with increasing mass. The interplay between these ef- 
fects means that haloes for which the mean galaxy occupa- 
tion number is less than unity can make a significant contri- 
bution to the clustering amplitude. Assuming that there is 
exactly one galaxy per halo and that the dark matter halo 
abundance matches a given galaxy abundance, results in too 
large a dark halo mass being assigned to each galaxy; conse- 
quently the "one-galaxy-per-halo" model overestimates the 
correlation strength. 

We compare the predicted clustering of galaxies in our 
model with observational data out to z ~ 1 in Fig. |j| In the 
top panel, the data points come from a preliminary anal- 
ysis of clustering in the CNOC2 field survey by Carlberg 
et al. (1999). These authors consider galaxies brighter than 
the spectroscopic limit of their survey, ma = 21.5, and use 
the measured redshift of each galaxy to estimate an abso- 
lute magnitude, M. Galaxies brighter than M = —20 — z 
are then selected, where the redshift dependence represents 
an approximate correction for band shifting and evolution. 
We apply the same selection criteria to galaxies in our semi- 
analytic model. The solid line in Fig. |j| shows the predicted 
correlation length obtained using equation (^), while the 
dashed line shows the predictions using the modified expres- 
sion for the bias given by equation (^). Fig. [](b) shows the 
model predictions for galaxies with observer frame J-band 
magnitudes in the range 17.5 < Iab < 22.5. The squares 
show the comoving correlation lengths estimated from the 
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Canada-France Redshift Survey (CFRS) by LeFevre et al. 
(1996), for two different assumptions for the underlying cos- 
mology. The solid circle comes from Postman et al. (1998) 
who analysed angular correlations over a much larger solid 
angle, applying a similar magnitude selection to that in the 
CFRS. The correlation length they obtain is a factor of 2 
larger than the CFRS value. Postman et al. argue that the 
CFRS fields are too small and that sampling fluctuations of 
this magnitude are expected in fields of this size. Our model 
predictions agree very well with the results of Postman et 
al. 



4 DISCUSSION AND CONCLUSIONS 

We have shown that the application of simple selection cri- 
teria (e.g. an apparent magnitude limit) can lead to a non- 
monotonic dependence of clustering strength on redshift. 
The often used "one-galaxy-per-halo" model predicts clus- 
tering evolution that is qualitatively similar to that pre- 
dicted by the semi-analytic model. However this simple 
model can give results that are quite wrong in detail. The rel- 
atively complex behaviour of the galaxy clustering strength 
with redshift is not reproduced by the e-model which, fur- 
thermore, offers no insight into the physical processes that 
drive the evolution of clustering. 

Our results and those of Benson et al. (1999) indicate 
that a cold dark matter model that agrees well with galaxy 
clustering in the local universe also agrees well with clus- 
tering data at high redshift. In particular our fio = 0.3, 
Ao = 0.7, as = 0.94 cold dark matter model matches the 
APM correlation function at low redshift (Benson et al. 
1999), the CNOC2 (Carlberg et al. 1999) and Postman et 
al. (1998) data at z<J0.5 and the Lyman-break galaxy data 
(Adelberger et al. 1998) at z ~ 3 (Baugh et al. 1998; Gov- 
ernato et al. 1998). 

There is little doubt that forthcoming large-area imag- 
ing surveys and catalogues of galaxy redshifts will lead to an 
explosion in the quantity of data bearing on the evolution 
of galaxy clustering. It is unclear whether these data will 
constrain cosmological parameters or the nature of galaxy 
evolution. In either case, the modelling techniques used in 
this paper, and those used in related studies (Governato et 
al. 1998; Benson et al. 1999; Kauffmann et al. 1998, 1999), 
will play an important role in the interpretation of the data. 
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